Evidence on language model consciousness
lesswrong.com·15h
🪞Metacognition
Flag this post
We are building AI slaves. Alignment through control will fail
utopai.substack.com·1d·
Discuss: Substack
🧭Ethics
Flag this post
Anthropic Research Shows How LLMs Perceive Text via @sejournal, @martinibuster
searchenginejournal.com·2d
🤖AI
Flag this post
Emergent Introspective Awareness in Large Language Models
transformer-circuits.pub·2d·
🪞Metacognition
Flag this post
Take Weird Ideas Seriously
notboring.co·2d·
Discuss: Hacker News
🧭Ethics
Flag this post
GenAI Poisoning: How Fewer Than 100 Samples Can Corrupt a Multi-Billion Parameter Model
pub.towardsai.net·1d
🤖AI
Flag this post
LLM-generated text is not testimony
lesswrong.com·4h
💬LLM
Flag this post
How I Learned to Stop Worrying and Love My Shitty Life
thedriftmag.com·1d·
Discuss: Hacker News
👤Psychology
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.com·1d
🤖AI
Flag this post
Show HN: Why write code if the LLM can just do the thing? (web app experiment)
github.com·1h·
Discuss: Hacker News
🤖AI
Flag this post
“Gender without Children”
marginalrevolution.com·14h
👤Psychology
Flag this post
Heart Mountain
xkcd.com·1d
🏛️Philosophy
Flag this post
Carlo Rovelli’s Radical Perspective on Reality
quantamagazine.org·3d·
🏛️Philosophy
Flag this post
Asking Paul Fussell for Writing Advice
lesswrong.com·15h
🏛️Philosophy
Flag this post
Signs of introspection in large language models
anthropic.com·3d·
🪞Metacognition
Flag this post
Dating: A mysterious constellation of facts
dynomight.net·2d·
Discuss: Hacker News
👤Psychology
Flag this post
OpenAI updates terms to forbid usage for medical and legal advice
openai.com·20h·
Discuss: Hacker News
🧭Ethics
Flag this post
Reasoning Models Reason Well, Until They Don't
arxiv.org·4d·
Discuss: Hacker News
🤖AI
Flag this post
Freewriting in my head, and overcoming the “twinge of starting”
lesswrong.com·18h
🪞Metacognition
Flag this post
Debugging Despair ~> A bet about Satisfaction and Values
lesswrong.com·1d
👤Psychology
Flag this post